Application No. 1 0/50 1 ,227 

Reply to Office Action of April 26, 2007 

Amendments to the Drawines: 

The attached replacement sheet of drawings includes changes to Fig. 9. This sheet, 
which includes Figs. 9 and 10, replaces the original sheet 6 of 13 of drawings. In Fig. 9, the 
y-axis is labeled as "Luciferase Activity" in English instead of Japanese. 



7895209 



7 



Application No. 10/501,227 

Reply to Office Action of April 26, 2007 



REMARKS 

Upon entry of the foregoing amendments, claims 1-16 and 18-23 will be pending in this 
application. Claims 1, 2 and 19 are the only independent claims. 

Claims 1, 2, 4, 5, 7-10, 12, 15, 16, 18-23 have been amended, without prejudice or 
disclaimer. Claim 17 is cancelled, without prejudice or disclaimer. 

It is respectfully submitted that all claim amendments made herein are fully supported by 
the claims and specification of the application as originally filed and introduce no new subject 
matter. For example, the amendment to claim 18 is at least supported by original claim 17, and 
the amendments to claim 8 and 9 are at least supported by the examples in the specification. 
Entry of all of the amendments are respectfully requested. 

Initially, Applicant is pleased to note that there are no prior art rejections. 
Election/Restriction Requirement 

The Examiner has made the previous restriction/election of species requirements final. 
Claims 20-23 are withdrawn from further consideration as being drawn to a nonelected 
invention. Claims 1-19 are under consideration to the extent they read on the elected species, 
i.e., SEQ ID NOs: 4 and 5 and their related sequences. 

The claims have been amended to cancel the non-elected species, SEQ ID NOs:23 and 
24. However, Applicant respectfully requests the Examiner to rejoin SEQ ID NOs:l and 2 for 
examination together with SEQ ID NOs: 4 and 5. The Examiner acknowledges that the elected 
species includes a nucleotide sequence encoding an amino acid sequence exhibiting 95% or more 
amino acid sequence identity to the amino acid sequence of SEQ ID NO:4. According to the 
amino acid sequence search results available at the USPTO Public PAIR website, a copy of 
which is attached to this Amendment as Appendix 1, SEQ ID NO:4 exhibits 96.2% query match 
with SEQ ID NO: 1 (see result 2 in Appendix 1). In addition, according to the nucleic acid 
sequence search results available at the USPTO Public PAIR website, a copy of which is 
attached to this Amendment as Appendix 2, SEQ ID NO:2 exhibits 95.9% query match with 
SEQ ID NO:5 (see result 2 in Appendix 2). Because of the high degree of sequence similarities 
between the amino acid sequences SEQ ID NOs:l and 4, and their coding sequences SEQ ID 
NOs:2 and 5, SEQ ID NOs:l and 2 should be considered to be within the elected species. 
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After entry of the claim amendment, claim 1 is directed to an isolated estrogen receptor 
alpha gene, which encodes the amino acid sequence SEQ ID NO:l and can have the DNA 
sequence of SEQ ID NO:2 (see page 60, lines 5-7), and its related sequences; and estrogen 
receptor alpha 2 gene, which encodes the amino acid sequence SEQ ID NO:4 and can have the 
DNA sequence of SEQ ID NO:5 (see page 62, lines 11-13), and its related sequences. The 
estrogen receptor beta gene, which encodes the amino acid sequence SEQ ID NO:23 and can 
have the DNA sequence of SEQ ID NO:24 (see page 94, last line, to page 95, line 2), and its 
related sequences, are cancelled, without prejudice or disclaimer. 

Accordingly, examination on the merits of claims that read on SEQ ID NOs: 4 and 5, as 
well as SEQ ID NOs:l and 2, and their related sequences is respectfully requested. 
Objections 

The Abstract of the disclosure has been objected to because it allegedly is written in four 
paragraphs. The abstract is amended to read as a single paragraph in narrative form. 
Accordingly, withdrawal of the objection to the Abstract is respectfully requested. 

Figure 9 is objected to because the X-axis [sic; y-axis] is labeled in Japanese. A 
replacement sheet of the drawings is submitted. This replacement sheet, which includes Figs. 9 
and 10, replaces the original sheet 6 of 13 of drawings. After entry of the replacement sheet, in 
Fig. 9, the y-axis is labeled as "Luciferase Activity" in English instead of Japanese. 
Accordingly, withdrawal of the objection to Fig. 9 is respectfully requested. 

Claims 4 and 1 1 have been objected to because of some informalities. Claim 4 has been 

amended to recite "wherein a promoter ispperably linked "as suggested by the Examiner. 

Claim 1 1 has been amended to recite "a mammalian cell" as suggested by the Examiner. 
Accordingly, withdrawal of the objections to claims 4 and 1 1 is respectfully requested. 

Claims 1, 2 and 19 have been objected to as reciting non-elected species. These claims 
have been amended to cancel recitations related to SEQ ID NO:23 or 24. As discussed above, 
SEQ ID NOs:l and 2, although not explicitly elected, are within the elected species, thus are 
properly included in the amended claims. Accordingly, withdrawal of the objection to claims 1, 
2 and 19 is respectfully requested. 
Claim Rejections under 35 USC § 101 

Claims 1, 2 and 17-19 are rejected under 35 USC § 101 because the claimed invention is 
allegedly directed to non-statutory subject matter. 
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The rejection of claim 17 is moot in view of the cancellation of claim 17 after entry of the 
claim amendment. 

Claims 1, 2, 18 and 19 are amended to recite that the estrogen receptor gene, the DNA or 
the estrogen receptor is "isolated". 

Accordingly, withdrawal of the rejection to claims 1, 2 and 17-19 under 35 USC § 101 is 
respectfully requested. 

Claim Rejections under 35 USC § 112. Second Paragraph 

Claims 5, 6, 8-14 and 16-18 are rejected under 35 USC § 112, second paragraph, as being 
indefinite. 

The Examiner asserts that claims 5 and 6 are vague and indefinite, because it is unclear 
how a viral particle can contain a virus. 

Claim 5 is amended to recite that the vector is a "viral vector". The term "viral vector" 
has a definite meaning in the art, i.e., the genetic material of a virus that has been modified to 
serve as a vector to deliver genetic materials into a cell of interest in recombinant technology. 
Applicant respectfully submits that after entry of the claim amendments, claims 5 and 6 are 
definite. 

The Examiner asserts that claim 16 is incomplete, because it is unclear what method step 
is intended by the recitation of "producing estrogen receptor." 

Claim 16 has been amended to recite "thereby producing the estrogen receptor." After 
entry of the claim amendment, "producing the estrogen receptor" is no longer a method step, but 
a desired result from the culturing step. Applicant respectfully submits that the amended claim 
16 is complete and definite. 

The Examiner asserts that claims 8, 9 and 10 are vague and indefinite because these 
claims are directed to transformants, but also recite method steps, and that it is unclear whether 
these claims are directed to products or methods. The Examiner further asserts that, for claims 8- 
12, it is unclear if Applicant intends isolated cells or cells transformed in vivo. Claims 1 1-14 are 
rejected because they depend from the rejected claim 8. 

Claims 8 and 9 have been amended to recite that the transformant is "produced by 
introducing . . . into a host cell, wherein the host cell is isolated or cultured in vitro" Claim 10 
has been amended to recite the transformant of claim 8 "comprising the estrogen receptor gene in 
a chromosome of the host cell." Applicant respectfully submits that after entry of the claim 
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amendments, claims 8-12 are definite, thus claims 11-14 are also definite because they no longer 
depend from allegedly indefinite claim 8. 

The Examiner asserts that claim 17 is vague and indefinite in reciting "partial nucleotide 
sequence," because the specification has not provides a clear definition of this term. Claim 1 8 is 
also rejected because it depends from the rejected claim 17. 

Without acquiescing to the Examiner's reasoning, claim 17 is cancelled. The cancellation 
renders the above rejection to claims 17 and 18 moot. 

Accordingly, reconsideration and withdrawal of all rejections of claims under 35 USC 
§ 1 12, second paragraph, are respectfully requested. 
Claim Rejections under 35 USC § 112, First Paragraph 

Claims 17 and 1 8 are rejected under 35 USC § 1 12, first paragraph, as failing to comply 
with the enablement requirement. 

Without acquiescing to the Examiner's reasoning, claim 17 is cancelled. The cancellation 
renders the above rejection to claim 17 moot. 

Regarding claim 18, the Examiner asserts that the specification does not teach any 
limiting structure, rather it teaches one exemplary sequence, and that art of record does not make 
up for the deficiencies in the specification in specifically delineating which nucleotides encode 
the estrogen (ligand) binding domain (LBD). Applicant respectfully disagrees. 

The specification describes the LBD in sufficient detail which enables a person skilled in 
the art to make and use the LBD. In addition to the description about LBD as pointed out by the 
Examiner in the Office Action, the specification also provides specific examples on how to make 
and use a LBD from both an estrogen receptor alpha claimed in the present claims (Example 5, 
page 74, at line 23, to page 75, line 7), and a LBD of an estrogen receptor beta claimed in the 
present claims (Example 16, page 106, line 13, to page 108, line 3). Given the high degree of 
sequence similarities between estrogen receptor alpha (SEQ ID NO:l) and estrogen receptor 
alpha 2 (SEQ ID NO:4), one skilled in the art would readily know how to make and use the LBD 
of estrogen receptor alpha 2 based on the teaching of the present application. 

In addition, at the time of the invention, LBDs for many estrogen receptors have been 
identified and studied. Structural and sequence homologies were observed among the LBDs. 
Ligands for estrogen receptors were readily available. Various in vivo or in vitro techniques 
could be used for ligand binding assays to measure the binding of a ligand to a LBD. In fact, 
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several years before the present application was filed, in the June 1999 version of the SWISS- 
PROT/TrEMBL database, there were already about 30 estrogen receptor LBD sequences from 
different species, omitting those with only partially determined sequences (see Mueller-Fahrnow 
et al., Current Opinion in Biotechnology, 1999, 10:550-556, hereinafter "1999 review paper," 
which accompanies this Amendment for reference). Crystal structures for a number of estrogen 
receptors had already been determined then. According to the 1999 review paper, the 
interspecies homology for the LBD is much greater than the similarity between the overall 
sequences of alpha and beta receptor subtypes (see page 550, 2 nd column, 3 rd paragraph). At the 
time of the invention, the state of the art was such that once the sequence of an estrogen receptor 
is known, one of ordinary skill in the art would readily predict the LBD of the estrogen receptor. 

Claim 18 is directed to the LBD of an estrogen receptor gene of claim 1, which includes 
the estrogen receptor alpha and alpha 2 genes, and their related sequences. In view of the 
teaching of the present application, particularly in view of the sequences for the estrogen 
receptor alpha and alpha 2 genes provided in the present application, e.g., SEQ ID NOs:l and 4 
for the amino acid sequences, and SEQ ID NOs: 2 and 5 for the coding sequences, sequence 
alignment of the present estrogen receptor alpha and alpha 2 genes with other known estrogen 
receptor genes can be readily performed. The LBD sequences can be readily predicted based on 
the sequence alignment. Proteins containing the predicted LBD sequences can be readily made 
by DNA recombinant technology. The LBD functions of such proteins can be readily tested in 
vivo or in vitro. Therefore, a partial nucleotide sequence that encodes for a LBD of an estrogen 
receptor of the present application is enabled. 

Accordingly, reconsideration and withdrawal of the rejections of claims 17 and 18 under 
35 USC § 1 12, first paragraph, as failing to comply with the enablement requirement, are 
respectfully requested. 

CONCLUSION 

Reconsideration and withdrawal of all of the objections and rejections are respectfully 
solicited. 

Applicant believes that the present application is now in condition for allowance. Upon 
finding composition claims allowable, Applicant respectfully requests the Examiner to rejoin 
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method claims that include all of the limitations of the allowable composition claims. A Notice 
of Allowance with respect to all elected composition claims and the rejoined method claims is 
respectfully solicited. 

The Examiner is invited to contact the undersigned attorney for any reason to advance the 
prosecution of this application. 



Respectfully submitted, 



KOICHI SAITO 

(^AjU^ By f — = 

(Date) ALAN S. NADEL 

Registration No. 27,363 

AKIN GUMP STRAUSS HAUER & FELD LLP 

One Commerce Square 
2005 Market Street, Suite 2200 
Philadelphia, PA 19103-7013 
Telephone: 215-965-1200 
Direct Dial: 215-965-1280 
Facsimile: 215-965-1210 
E-Mail: anadel@akingump.com 

ASN/WH/hg 
Enclosures 
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OM protein - protein search, using sw model 
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Title: 

Perfect ecore: 
Sequence: 

Scoring table: 



Search time 50 Seconds 
(without alignments) 
1887.708 Million cell updates/sec 

US-10-501-227-1 
2672 

1 MSLKDWLLGKERTWTMEEL PTGPGVLQYGGSRSDCTHIL 506 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 800936 seqs, 185091930 residues 

Total number of hits satisfying chosen parameters: 800936 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_AA_New: * 

1: /EMC_.Celerra_SIDS3/ptodata/2/pubpaa/US06_NEW_PUB.pep:* 
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8 : / EMC_Celerra_S IDS 3 /ptodat a/ 2 /pubpaa/US6 0_NEW_PUB . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-10-501-227-1 

; Sequence 1, Application US/10501227 
; Publication No. US20060141560A1 
; GENERAL INFORMATION: 
; APPLICANT: SAITO, Koichi 

; TITLE OF INVENTION: Estrogen receptor genes and utilization thereof 

: FILE REFERENCE : 600630-21US (561334) 

: CURRENT APPLICATION NUMBER: US/10/501.227 

; CURRENT FILING DATE: 2004-07-12 ' 

; PRIOR APPLICATION NUMBER: JP 2002-004395 

; PRIOR FILING DATE: 2002-01-11 

; NUMBER OF SEQ ID NOS: 48 

; SEQ ID NO 1 

; LENGTH: 506 

; TYPE: PRT 

ORGANISM: Blue Gill 
US-10-501-227-1 

Query Match 100.0%; Score 2672; DB 6; Length 506- 

Best Local Similarity 100.0%; Pred. No. 4 4e-215- 

Matches 506; Conservative 0; Mismatches 0;' Indels 0; Gaps 
QV 1 MSLKDWLLGKERTVVTMEELRSSVPSSQQPVPREDQCATSDESYSVGESGAGARGFEMAK 6 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 m 1 [ 1 [ 1 1 1 1 1 11 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 im 

Db 1 MSLKDWLLGKERTWTMEELRSSVPSSCK!PWREDQCA € 

Qy 61 EMRFCAVCSDYASGraW 1 

CURRENT FILING DATE: 2004-07-12 
PRIOR APPLICATION NUMBER: JP 2002-004395 
PRIOR FILING DATE: 2002-01-11 
NUMBER OF SEQ ID NOS: 48 

SE ° id ii 1 11 11 1 1 mi 11 minium 

Db 121 RLRKCYEVGMMKGGVRKDRGRVLRRDKRRAGTNDREKASKDLEYKTVPPQDRRKHSSSSS li 

QY 181 AGGGGGKSSVTGMSPDQVLLLLQGAEPPMLCSRQKLSRPYTEVTIMTLLTSMADKELVHM 2- 

http://portal.uspto.gov/external/PA_1.0J5H/ViewSequenceServlet7pair_score m... 
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iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiMiiiiiMiimm 

181 AGGGGGKSSVTGMSPDQVLLLLQGAEPPMLCSRQKLSRPYTEVTIMTLLTSMADKELVHM 240 
241 ITWAKKLPGFLQLSLHDQVQLLESSWLEVLMIGLIWRSIHCPGKLIFAQDLILDRNEGDC 300 

llllllllllilllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMI 

241 ITWAKKLPGFLQLSLHDQVQLLESSWLEVLMIGLIWRSIHCPGKLIFAQDLILDRNEGDC 300 
301 VEGFVEIFDMLLATASRFRMLKLKPEEFVCLKA1ILLNSGAFSFCTGTMEPLHNSMAVQN 360 

iiMiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiMimimiiiiiiiimi! 

301 VEGFVEIFDMLLATASRFRMLKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHNSMAVQN 360 
361 MLDTITDALIHHISQSGCSAQQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLY 420 

iiiniiiiii Miiiiiiiiiiiiii """"""" 'iijiii'i 

361 MLDTITDALIHHISQSGCSAQQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLY 420 
421 DLLLEMLDAHRIHRPDRPAQFWSQADGEPPFINNNNSSNSGSNGGVSSSVGSSSGPRVNH 480 

I!III11III1II1IIIII!II!I!I!1IIIIIIIIMI!1II1IIIMII1IIIMI!I 

421 DLLLEMLDAHRIHRPDRPAQFWSQADGEPPFINNNNSSNSGSNGGVSSSVGSSSGPRVNH 480 

481 ESPSRGPTGPGVLQYGGSRSDCTHIL 506 
llllllllllllllllilllllllll 



RESULT 2 
US-10-501-227-4 

, Sequence 4. Applicatio! | | | | | | [ | | | | | | | | | | ! | I I I I I I I I I I I I I I I I I I 1 I I I I I 1 I I I I 1 ! 1 I I I I I I I 
Db 241 ITWAKKLPGFLQLSLHDQVQLLESSWLEVLMIGLIWRSIHCPGKLIFAQDLILDRNEGDC 300 

Qy 301 VEGFVEIFDMLLATASRFRMLKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHNSMAVQN 360 

luiiiiiiii miiiiiiiiiiiMim [iiiiiiniiiiiiiiii 

Db 301 VEGFVEIFDMLLATASRFRMLKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHNSMAVQN 360 

Qy 361 MLDTITDALIHHISQSGCSAQQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLY 420 

MllllllllllllllllimilllMIIMIIIIIM MIIIIIIIIIIIIII 

Db . 361 MLDTITDALIHHISQSGCSAQQQSRRQAQLLLLLSHIRHMSNKGMEn US/10501227 

j Publication No. US20060141560A1 
. GENERAL INFORMATION: 

APPLICANT: SAIT0, Koichi 

TITLE OF INVENTION: Estrogen receptor genes and utilization thereof 
FILE REFERENCE: 600630-21US (561334) 
CURRENT APPLICATION NUMBER: US/10/501.227 
CURRENT FILING DATE: 2004-07-12 
PRIOR APPLICATION NUMBER: JP 2002-004395 
PRIOR FILING DATE: 2002-01-11 
NUMBER OF SEQ ID NOS: 48 
SEQ ID NO 4 

LENGTH: 582 

TYPE: PRT 

ORGANISM: Blue Gill 
US-10-501-227-4 

Query Match 96.2%; Score 2571: DB 6; Length 582; 

Best Local Similarity 100.0%; Pred. No. 1.5e-206; 

Matches 486; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 21 RSSVPSSQQPVPREDQCATSDESYSVGESGAGARGFEMAKEMRFCAVCSDYASGYHYGVW 80 

lllljlllllllllllllllllllllllllllllllllllllllMllllllllllllil 

Db 97 RSSVPSSQQPVPREDQCATSDESYSVGESGAGARGFEMAKEMRFCAVCSDYASGYHYGVW 156 

Qy 81 SCEGCKAFFKRSIQGHNDYMCPATNQCTIDRNRRKSCQACRLRKCYEVGMMKGGVRKDRG 140 

iiniiiiiiiiiiiiiniiiiiiniiimiMMiiiniiiimimiiiMi 

Db 157 SCEGCKAFFKRSIQGHNDYMCPATNQCTIDKNRRKSCQACRLRKCYEVGMMKGGVRKDRG 216 

Qy 141 RVLRRDKRRAGTNDREKASKDLEYKTVPPQDRRKHSSSSSAGGGGGKSSVTGMSPDQVLL 200 

jmiiiiiiiiiiiiiiiiiiiimiimmmimimiiiiiiMMiii! 

Db 217 RVLRRDKRRAGTNDREKASKDLEYKTVPPQDRRKHSSSSSAGGGGGKSSVTGMSPDQVLL 276 

Qy 201 LLQGAEPPMLCSRQKLSRPYTEVTIMTLLTSMADKELVHMITWAKKLPGFLQLSLHDQVQ 2PRT 

ORGANISM: Blue Gill 
US-10-501-227-4 
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Query Match 96.2%; Score 2571; DB 6; Length 582; 

Best Local Similarity 100.0%; Pred. No. 1.5e-206; 

Matches 486; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
1 Y 21 RSSVPSSQQPVPREDQCATSDESySVGESGAGARGFEMAKEMRFCAVCSDYASGYHYGVW 80 

IIIIIIIIMIIIIIIIIIIIIIMIIIIIIIII ll,ll i l, U'Ulvi 

Db 97 RSSVPSSQQPVPREDQCATSDESYSVGESGAGARGFEMAKEMRFCAVCSDYASGYHYGVW 156 

iy 81 SCEGCKAFFKRSIQGHNDYMCPATNQCTIDRHRRKSCQACRLRKCYEVGMMKGGVRKDRG 140 

!!l!!!!!!!!!!!!iTi[iiiiiiiiiiiiiiiiiiii!iiitiiiMiiiiiiii!ii 

Jb 277 LLQGAEPPMLCSRQKLSRPCTEVT1MTLLTSMADKELVHMITWAKKLPGFLQLSLHDQVQ 336 

3y 261 LLESSWLEVLMIGLIWRSIHCPGKLIFAQDLILDRNEGDCVEGFVEIFDMLLATASRFRM 320 

ilium in mi mim in mmmimiimimi 

Db 337 LLESSWLEVLMIGLIWRSIHCPGKLIFAQDLILDRNEGDCVEGFVEIFDMLLATASRFRM 396 

Qy 321 LKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHNSMAVQNMLDTITDALIHHISQSGCSA 380 

iiiiimiiiiiiiiiiiimiiiiimiimiiiii iiimimi 

Db 397 LKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHNSMAVQNMLDTITDALIHHISQSGCSA 456 

Qy 381 GSQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLYDLLLEMLDAHRIHRPDRPAQ 440 

M I i 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 i 1 [ I M 1 1 M 1 1 1 1 1 1 1 1 H 1 1 1 i ! M 1 1! I M I M 

Db 457 QQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLYDLLLEMLDAHRIHRPDRPAQ 516 



Qy 501 DCTHIL 506 

mm 

Db 577 DCTHIL 582 

RESULT 3 

US-11-355-544-31 

; Sequence 31, Application US/11355544 

; Publication No. US20060183159A1 

; GENERAL INFORMATION : 

; APPLICANT: Zhao, Huirain 

; APPLICANT: Chen, Zhilei 

; TITLE OF INVENTION: METHOD FOR ENGINEERING A PROTEIN BY IN VITRO COEVOLUTION 
; FILE REFERENCE.: IL0013US 

; CURREPLHNSMAVQNMLDTITDALIHHISQSGCSA 380 

1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 H ] 1 1 1 1 1 1 M 1 1 1 1 ! 1 1 1 i I M 1 1 1 1 1 

Db 397 LKLKPEEFVCLKAIILLNSGAFSFCTGTMEPLHNSMAVQNMLDTITDALIHHISQSGCSA 456 

Qy 381 QQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLYDLLLEMLDAHRIHRPDRPAQ 440 

I I I I Ml III I I I I I I I III I III I I III I I I I Ml I I I I Ml HI HI I I HI I 

Db 457 QQQSRRQAQLLLLLSHIRHMSNKGMEHLYSMKCKNKVPLYDLLLEMLDAHRIHRPDRPAQ 516 



4411 . . ...._„_.„„__. j 1 1 j j j j 1 1 , , , , , , | | , , ( , , | , | | | j | , , | | | 

>b NT APPLICATION NUMBER: US/11/355.544 

CURRENT FILING DATE: 2006-02-16 
PRIOR APPLICATION NUMBER: US 60/654,269 
PRIOR FILING DATE: 2005-02-17 
NUMBER OF SEQ ID NOS: 46 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 31 
LENGTH: 627 
TYPE: PRT 

ORGANISM: Micropterus salmoides 
US-11-355-544-31 

Query Match 92.4%; Score 2469; DB 7; Length 627; 

Best Local Similarity 96.9%: Pred. No. 5.7e-198; 

Matches 471; Conservative 4; Mismatches 9; Indels 2; Gaps 2; 

Qy 21 RSSVPSSQQPVPREDQCATSDESYSVGESGAGARGFEMAKEMRFCAVCSDYASGYHYGVW 80 
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United States Patent & Trademark Office 
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1 0/501 ,227 Estrogen receptor genes and utilization thereof 




Supplemental Content - Search Results . 

This page gives you Information about the number of versions associated with the application you request- 
Use this page to obtain specific version information. 



Item Size 



Version 



Number 


Wltem ID " 




1 


09323b67800d7a2e 147.879 


O 


1_ 


09323b67800d7a31 147.879 


O 


X 


09323b67800d7a34147.879 


o 


L 


09323b67800d7a37 147.879 


o 


1_ 


09323b67800d7a3a 147.879 


o 


L 


09323b67800d7a3e 147.879 


o 


1... 


09323b67800d7a41 147.879 


o 


L 


09323b67800d7a44 147.879 


o 


L 


09323b67800d7a47147.879 


o 


i. 


09323b67800d7a4a 147 .879 


o 


.!. 


09323b67800d7a4d 147.879 


o 


JL 


09323b67800d7a50 147.879 


o 


L. 


09323b67800d7a54 147.879 


o 


1. 


09323b67800d7a57147.879 


o 


1 


09323b67800d7a5a 147.879 


o 


L 


09323b67800d7a5d 147.879 


o 


.1. 


09323b67800d7a61 147.879 


o 


1. 


09323b67800d7a64147.879 


o 


JL 


09323b67800d9046147.879 


o 


1_ 


09323b67800d9049147.879 


o 


L 


09323b67800d904e 147.879 




JL 


09323b67800d9051 147.879 


o 


1 


09323b67800d9056147.879 


o 


JL 


09323b67800d905bl47.879 


o 


L 


09323b67800d9065147.879 


o 


i. 


09323b67800d9067147.879 


o 


l 


09323b67800d906c 147.879 


0 


i. 


09323b67800d906f 147.879 


0 


1_ 


09323b67800d9072147.879 


p 


JL 


09323b67800d9075147.879 


o 


1 


09323b67800d9079147.879 


0 


.L. 


09323b67800d907c 147.879 


o 


J... 


nq?91hfi7Snnd9082147.879. £=> 


® 


1. 


09323b67800d9a72 147.879 


0 


1. 


09323b67800d9a77147.879 


0 


1. 


09323b67800d9a7c 147.879 


c 
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10/501,227 Estrogen receptor genes and utilization thereof 

GenCore version 6.2 

Copyright (c) 1993 - 2007 Biocceleration Ltd. 

OM nucleic - nucleic search, using sw model 

Hun on: March 11, 2007. 15:24:15 ; Search time 1219 Seconds 

(without alignments) 
8479.367 Million cell updates/sec 

Title: US-10-501-227-5_COPV_74_1819 
Perfect score: 1746 

Sequence: 1 atgtaccctgaagagagcag ctgactgcacccacatccta 1746 



Searched: 11113607 seqs, 2959867085 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Listing first 45 

ase : Published_ J Applications_NA_News* 

1: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US08_NEW_PUB.seq:* 
2 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US06_NEW_J?UB. seq: * 
3 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US07_NEW_PUB . aeq : * 
4: /EMC_Celerra_SIDS3/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 
5: /EMC_Celerra__SIDS3/ptodata/2/pubpna/US09_NEW_PUB.3eq : * 
6: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB.seq:* 
7 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB . seql : * 
8 : /EMC_Celerra_SIDS3/ptoaata/2/pubpna/US10_NEWJPUB. seq2 : * 
9: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB.seq3 : * 
10 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_pUB . seq4 : * 
11 : /EM<Tcelerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB . seq5 : * 
12 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB . seq6 : * 
1 3 : /EMC_Celerra_SiDS3 /ptodata/2 /pubpna/US 1 1_NEW_PUB . seq : * 
14 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_puB. seql : * 
15 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB .seq2 : * 
16: /EMC Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB.seq3-.* 
17 : /EMcIcelerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB. seq4 : * 
18 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US60_NEW_PUB .seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is der0_NEW_PUB . seql : * 

8: /EMC_Celerra_SIDS3/ptodata/2/pubpna/VS10_NEW_PUB.seq2:* 
9: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB.seq3:* 
10 : /EMC_Celerra_SiDS3/ptofiata/2/pubpna/US10_NEW_PUB . seq4 : * 
11 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB . seq5 = * 
12 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB . seq6 : * 
13 : /EMC_Celerra_SiDS3/ptodata/2/pubpna/USll_NEW_PUB . seq : * 
14 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB . seql : * 
15: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_ived by analysis of the tota 



1 1746 100.0 1824 11 US-10-501-227-5 Sequence 5, Appli 

> (|) 1674 95.9 3499 11 US-10-501-227-2 Sequence 2 , Appli 

3 988.6 56.6 996 11 US-10-501-227-3 Sequence 3. Appli 

4 482.6 27.6 1788 16 US-11-355-544-1 Sequence 1, Appli 
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5 


482.6 


27.6 


1788 


16 


US -11- 368-891-1 


Sequence 1, A P nl * 


6 


481 


27.. 5 


1788 


16 


US-11-355-544-15 


Sequence 15, Appl 


7 


481 


27.5 


6450 


8 


US- 10- 940-774- 18 


Sequence 18 , Appl 


6 


481 


27.5 


6450 


11 


US-10-515-675A-47 


Sequence 47. *PP^ 


9 


481 


27.5 


6450 


11 


US-10-517-155A-25 


Sequence , pp 


10 


481 


27.5 


6450 


14 


US-11- 289- 102-120 


s; "ence 91 Arral 


11 


481 


27. 5 






US- 11 -283- 329-91 




12 


481 


27.5 


6450 


15 


US -11-3 46-7 59 -100 


Sequence ^ ^ pp 


13 


479. 4 


27.5 


1788 


16 


US-11- 355-544-7 


Se^ence 9 Appli 


14 


477.8 


27.4 


1788 


16 


US- 11 -355-544-9 


qu ADDli 


15 


476 . 2 


27 .3 






US- 11-355-544-11 




3 


988.6 


56.6 


996 


11 


US-10-501-227-3 


Sequence 3 , Appli 


4 


482.6 


27.6 


1788 


16 


US-11-355-544-1 


Sequence 1. * pp ^ 


S 


482.6 


27.6 


1788 


16 


US-11-368-B91-1 


Sequence , pp 


6 


481 


27.5 


1788 


16 


US-11-355-544-15 


Sequence 15, Appl 


7 


481 


27.5 


6450 


8 


US- 10- 940-774-18 


Sequence 18 A P pl 


8 


481 


27.5 


6450 


11 


US-10-515-675A-47 


Sequence 47, Appl 


9 


481 


27.5 


6450 




US-10-517-155A-25 




10 


481 


27.5 


64ce 


11. 


Appl 




16 


474.6 


27.2 


1788 


16 


US-11-355-544-13 


13 A 1 
Sequence , PP 


17 


428.4 


24 . 5 


2186 


11 


US- 10 -501- 227-24 


Sequence , pp 


18 


389.8 


22.3 


1593 


16 


US-11-355-544-5 


Sequence 5 , Appii 


19 


389.8 


22.3 


1593 


16 


US-11-368-891-5 


Sequence 5 , Appli 


20 


389.8 


22.3 


1745 


8 


US- 10- 940 -774- 1563 


Sequence 1563, Ap 


21 


389.8 


22.3 


2011 


14 


US-11-283-329-93 


Sequence 93, Appl 


22 


389.8 


22.3 


2011 


15 


US-11-346-759-101 


Sequence 101, App 


23 


361 


20.7 


882 


16 


US-11-355-544-3 


Sequence 3 , Appli 


24 


361 


20.7 


882 


16 


US-11-368-891-3 


Sequence 3 , Appll 


25 


353 


20.2 


6039 


11 


US-10-541-683-6 


Sequence 6 , Appll 


26 


353 


20.2 


14411 


11 


US-10-541-683-7 


Sequence 7 , Appli 


27 


346.2 


19.8 


2739 


16 


US-11-280-456-39 


Sequence 39 , Appl 


C 28 


346.2 


19.8 


2739 


16 


US-11-260-456-44 


Sequence 44, Appl 


29 


346.2 


19.8 


2775 


16 


US-11-280-456-41 


Sequence 41, Appl 


c 30 


346.2 


19.8 


2775 


16 


US-11-280-456-43 


Sequence 43, Appl 


31 


340.8 


19.5 


963 


11 


US-10- 501-227-25 


Sequence 25, Appl 


32 


299.6 


17.2 


1000 


14 


US-11-266-748A-395231 


Sequence 395231,. 


C 33 


299.6 


17.2 


1000 


14 


US-11-266-748A-466277 


Sequence 466277, 


34 


292 


16.7 


5001 


8 


US-10 -940-774- 1495 


Sequence 1495, Ap 


35 


240 


13.7 


553 


11 


US- 10- 834-268 -2626 


Sequence 2626, Ap 


36 


234.4 


13.4 


751 


11 


US- 10 -956- 160- 3191 


Sequence 3191, Ap 
















38 


215 


12.3 


611 


14 


US-11-266-748A-214650 


Sequence 214650, 


39 


191.8 


11.0 


600 


11 


US-10-956-160-213298 


Sequence 213298, 


40 


173.4 


9.9 


600 


11 


US- 10-956-160-213297 


Sequence 213297, 


41 


173.4 


9.9 


748 


11 


US-10-956-160-3190 


Sequence 3190, Ap 


42 


133.6 


7.7 


1503 


11 


US-10-526-940-7 


Sequence 7, Appli 


43 


132.6 


7.6 


1566 


11 


US-10-526-940-2 


Sequence 2, Appli 


44 


132.6 


7.6 


2402 


14 


US-11-266-748A-32133 


Sequence 32133, A 


45 


132 


7.6 


2193 


14 


US-11-283-329-97 


Sequence 97, Appl 



RESULT 1 
US-10-501-227-5 

,- Sequence 5. Application US/10501227 
; Publication No. US20060141560A1 
; GENERAL INFORMATION: 
; APPLICANT : SAITO, Koichi 

; TITLE OF INVENTION: Estrogen receptor genes and utilization thereof 

; FILE REFERENCE: 600630-21US (561334) 

, CURRENT APPLICATION NUMHEP ■ ITS/I 0/501 . 227 

? CURRENT FILING DATE: 2004-07-12 

; PRIOR APPLICATION NUMBER: JP 2002-004395 

; PRIOR FILING DATE: 2002-01-11 

: NUMBER OF SEQ ID NOS: 48 

: SEQ ID NO 5 

LENGTH: 1824 

TYPE: DNA 213298, 

40 173.4 9.9 600 11 US-10-956-160-213297 Sequence 213297, 
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41 173.4 9.9 748 11 US-10-956-160-3190 
ORGANISM: Blue Gill 
FEATURE : 
NAME/KEY: CDS 
LOCATION: { 74 ) . . . ( 1822) 
US-10-501-227-5 

Query Match 100.0%; Score 1746; DB 11; 

Best Local Similarity 100.0%: Pred. No. 0; 
Matches 1746; Conservative 0; Mismatches 0; 



iMiuiiiiimiiiiiiiiiiiiiMiiiimnmiiiiiiM^ 

74 ATGTACCCTGAAGAGAGCAGGGGGTCCGGAGGGGTAGCCACTGTGGACTTTCTGGAAGGG 133 
61 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGTCTGGCTAC 120 

INIl!NlllllllillllllN!lll!MillMil!lNllllllllMUj.J.'iil 10 , 

134 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGTCTGGCTAC 193 
121 TACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCCCTGGGC 180 

imiiiiiiiiiiiiiiiiiiiiiiiiiiiMmiiiiiniiniimm 

194 TACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCCCTGGGC 253 
181 A ^^™CAGT CC ™ "0 
254 AGCGGGCCTACCAGTCCTCTTGTGTTTGTGCCGTCCAGCCCCAGACTCAGCCCCTTTATG 313 
241 CACCCGCCCAGCCACCACTATCTGGAAACCACCTCAACACCCGTCTACAGGTCTAGTGTC 300 

• IIIIIIIIIIIIIIMIIIIIIIIIIIIIII1HIIIIIIIIIIIIIIIMIII1IIJII 

314 CACCCGCCCAGCCACCACTATCTGGAAACCACCTCAACACCCGTCTACAGGTCTAGT | | | | | | | | | | | | 
74 ATGTACCCTGAAGAGAGCAGGGGGTCCGGAGGGGTAGCCACTGTGGACTTTCTGGAAGGG 133 

61 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGTCTGGCTAC 120 

IIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII! 

134 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGTCTGGCTAC 193 
121 TACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCCCTGGGC 180 

M 1 1 1 1 1 ; 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 ! ! ! ' ; I ! i ! ■ 1 : 1 ! i 1 1 II M 1 1 M 1 1 i I M I 

194 TACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCCCTGGGC 253 
181 AGCGGGCCTGTC 373 

301 CCATCCAGCCAGCAGCCAGTTCCCAGAGAGGACCAGTGTGCCACCAGTGATGAGTCCTAT 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
374 CCATCCAGCCAGCAGCCAGTTCCCAGAGAGGACCAGTGTGCCACCAGTGATGAGTCCTAT 433 

361 AGTGTGGGGGAGTCAGGGGCTGGAGCCAGGGGGTTTGAGATGGCCAAGGAGATGCGTTTC 420 

1 1 1 1 1 1 i 1 1 1 1 l ] 1 1 1 1 1 1 1 1 1 i iiiiiiiniiiiniiiiiiii 

434 AGTGTGGGGGAGTCAGGGGCTGGAGCCAGGGGGTTTGAGATGGCCAAGGAGATGCGTTTC 493 
421 TGTGCTGTGTGCAGTGACTATGCCTCTGGGTACCACT ACGGGGTGTGGTCCTGTGAAGGC 480 

1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 111 1 1 1 1 1 1 

494 TGTGCTGTGTGCAGTGACTATGCCTCTGGGTACCACTACGGGGTGTGGTCCTGTGAAGGC 553 
481 TGTAAGGCCTTCTTTAAGAGGAGCATCCAGGGTCACAATGACTATATGTGCCCAGCAACC 540 

iiiniiiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiitiiiiimi 

554 TGTAAGGCCTTCTTTAAGAGGAGCATCCAGGGTCACAATGACTATATGTGCCCAGCAACC 613 
541 AATCAGTGTACTATTGACAGGAATCGGAGAAAGAGCTGCCAGGCTTGCCGTCTTAGGAAG 600 

Mil I MM IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

614 AATCAGTGTACTATTGACAGGAATCGGAGAAAGAGCTGCCAGGCTTGCCGTCTTAGGAAG 673 
601 TGTTATGAAGTGGGCATGATGAAAGGAGGTGTTCGCAAGGACCGTGGCCGTGTTTTGCGC 660 

iniiMiiiiiiiiiMimiiimiiiimiiimiiiimiiiiMiinii 

674 TGTTATGAAGTGGGCATGATGAAAGGAGGTGTTCGCAAGGACCGTGGCCGTGTTTTGCGC 733 
661 CGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAGGACCTGGAGTAC 720 

lllllllllllllllllllllillllMMIIIIIIIIIIllllMlllllinilllll 

734 CGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAGGACCTGGAGTAC 793 
721 AAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGTGCCGGTGGTGGA 780 
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iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiii 

794 AAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGTGCCGGTGGTGGA 853 
781 GGAGGAAAATCATCAGTGACCGGGATGTCTCCTGACCAGGTGCTCCTCCTGCTCCAGGGT 840 
854 GGAGGAAUiciicA^ 913 
841 GCCGAGCCCCCAATGCTGTGCTCCCGTCAGAAGCTGAGCCGACCGTACACCGAGGTCACC 900 

miiiiiimiiiiiimiMi imimmhmmmmimmimm 

914 GCCGAGCCCCCAATGCTGTGCTCCCGTCAGAAGCTGAGCCGACCGTACACCGAGGTCACC 973 
901 ATAATGACACTACTCACCAGCATGGCCGATAAGGAGCTGGTCCACATGATCACCTGGGCC 960 

iMiiiiiiiiiMimiiiiiiiimmMiiiiiMimiiMiiimmii 

974 ATAATGACACTACTCACCAGCATGGCCGATAAGGAGCTGGTCCACATGATCACCTGGGCC 1033 
961 AAGAAGCTTCCAGGTTTCCTGCAGCTGTCTCTCCATGACCAGGTGCAGCTGCTGGAGAGC 1020 

mi iiiiiiiii iniimiiiiiiiiiiiMiiiiiiiiMiMii 

1034 AAGAAGCTTCCAGGTTTCCTGCAGCTGTCTCTCCATGACCAGGTGCAGCTGCTGGAGAGC 1093 

1021 TCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCACTGCCCCGGCAAA 1080 

I I | I I IGAGGAAAATCATCAGTGACCGGGATGTCTCCTGACCAGGTGCTCCTCCTGCTCCAGGGT 840 

in inn in iiiiiii iiiiiii iiiiiiiiinin 1 1 nun iim tiiiiM i 

854|]|||||||!||||||11|||||||||||11|| Illllllllllllll 

1094 TCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCACTGCCCCGGCAAA 1153 

1081 CTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGTGTGGAAGGCTTT 1140 

MMMMMMIIMIIIIIIMIMIMMIMIMIIMIMIMMMIMMMI 

1154 CTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGTGTGGAAGGCTTT 1213 
1141 GTTGAGATCTTCGACATGCTGCTGGCCACTGCCTCCCGCTTCCGCATGCTCAAACTCAAA 1200 

MMMMIMMIMMMMIMIMIMIMMMIMMMMIMIIMMIIM 

1214 GTTGAGATCTTCGACATGCTGCTGGCCACTGCCTCCCGCTTCCGCATGCTCAAACTCAAA 1273 
1201 CCTGAGGAGTTTGTCTGCCTCAAAGCTATCATCCTGCTCAACTCTGGTGCCTTCTCTTTC 1260 

jiiiiiuiiiiiiiMiiiMMiiiiiiiMimimiiiiiiiiimniini 

1274 CCTGAGGAGTTTGTCTGCCTCAAAGCTATCATCCTGCTCAACTCTGGTGCCTTCTCTTTC 1333 
1261 TGCACCGGCACAATGGAGCCCCTCCACAACAGCATGGCAGTGCAGAACATGCTGGACACC 1320 

MMIIMMIMIIMIMMIIIMIMMMMIIIMMIIMIMM'IMMIM 

1334 TGCACCGGCACAATGGAGCCCCTCCACAACAGCATGGCAGTGCAGAACATGCTGGACACC 1393 
1321 ATCACAGACGCTCTCATACATCATATCAGCCAATCAGGATGCTCGGCTCAGCAGCAGTCG 1380 

II I M Ml 1 1 [MIIIM Hill!] Illllllllllilllll 1 1 1 1 Ml 1 1 1 M I M I II 

1394 ATCACAGACGCTCTCATACATCATATCAGCCAATCAGGATGCTCGGCTCAGCAGCAGTCG 1453 

1381 AGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATGAGCAACAAAGGC 1440 
| | | | | | | | | | | | | | | | | | | | | | [ | | | | | | [ | | | | | | | | | | || || | | | | | ICAAACTCAAA 1200 

I [ I i 1 1 1 1 1 L 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ii 1 1 1 1 1 1 1 k 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1214 GTTGAGATCTTCGACATGCTGCTGGCCACTGCCTCCCGCTTCCGCATGCTCAAACTCAAA 1273 
1201 CCTGAGGAGTTTGTCTGCCTCAAAGCTATCATCCTGCTCAACTCTGGTGCCTTCTCTTTC 1260 
1274 CclGAGGAGUUicUcci^ 1333 
1261 TGCACCGGCACAATGGAGCCCCTCCACAACAGCATGGCAGTGCAGAACATGCTGGACACC 1320 

I II 1 1 II II M I M M 1 1 1 1 M I II 1 1 1 M M II M 1 1 1 1 M 1 1 1 1 M 1 1 1 II I M 1 1 M 

1334 TGCACCGGCAC j j j | | j | | | | 

1454 AGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATGAGCAACAAAGGC 1513 
1441 ATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTACGACCTTCTGCTG 1500 

L 1 1 1 1 ! 1 1 II I M M 1 1 11 1 1 1 1 1 1 1 1 E 1 1 ! 1 1 1 1 1 1 i 1 1 ) 1 1 M M 1 1 1 M 1 1 1 1 L I ! I 

1514 ATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTACGACCTTCTGCTG 1573 
1501 GAGATGTTGGACGCTCACCGTATACACCGCCCAGACAGACCAGCTCAGTTCTGGTCCCAG 1560 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 J I i MM 

1574 GAGATGTTGGACGCTCACCGTATACACCGCCCAGACAGACCAGCTCAGTTCTGGTCCCAG 1633 
1561 GCTGACGGAGAGCCTCCCTTCATTAACAACAACAACAGCAGCAACAGTGGCAGCAATGGC 1620 

. m 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 m i m i m 1 1 1 n n 1 1 1 1 m 1 1 1 1 n 1 1 1 ii i 1 1 1 1 1 1 

1634 GCTGACGGAGAGCCTCCCTTCATTAACAACAACAACAGCAGCAACAGTGGCAGCAATGGC 1693 
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1621 GGCGTCTCCTCTTCAGTCGGTTCCAGTTC AGGACCCCGAGTCAACCACGAGAGCCCGAGC 1680 

llllllllllllllllllllllllllllllllillillilllllllilMIMIIIIIII 

1694 GGCGTCTCCTCTTCAGTCGGTTCCAGTTCAGGACCCCGAGTCAACCACGAGAGCCCGAGC 1753 
1681 AGAGGACCCACAGGTCCAGGAGTCCTGCAGTACGGAGGGTCCCGCTCTGACTGCACCCAC 1740 

iiiiiiimiiiiimiimiumimmmimmimiiiimiiiimmmimi . 
1754 agaggacccacaggtccaggagtcctgcagtacggagggtcccgctctgactgcacccac 1613 

1741 ATCCTA 1746 

llllll 
1814 ATCCTA 1819 



US-10-501-227-2 

; Sequence 2, Application US/10501227 

; Publication No. US20060141560A1 

, iNFORHiiiillHIIlllllllllllllNIIIIIMMIIMIIIlllll 



Qy 1561 ( 



I t I I I 1 1 I I I I I I I I I I I I I I I I I I 1 I t I II I I Mil II IIUiH'iiiiiiiUiiic 1693 
Db 1634 GCTGACGGAGAGCCTCCCTTCATTAACAACAACAACAGCAGCAACAGTGGCAGCAATGGC 1693 

OV 1621 GGCGTCTCCTCTTCAGTCGGTTCCAGTTCAGGACCCCGAGTCAACCACGAGAGCCCGAGC 1680 

1 1 1 1 M I M I L 11 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 JL J, I JL ' JLJLJL ' 'i ' ' 

Db 1694 GGCGTCTCCTCTTCAGTCGGTTCCAGTTCAGGACCCCGAGTCAACCACGAGAGCCATION : 

APPLICANT: SAITO, Koichi 

TITLE OF INVENTION: Estrogen receptor genes and utilization thereof 
FILE REFERENCE: 600630-21US (561334) 
CURRENT APPLICATION NUMBER : US/10/501,227 
CURRENT FILING DATE: 2004-07-12 
PRIOR APPLICATION NUMBER: JP 2002-004395 
PRIOR FILING DATE: 2002-01-11 
NUMBER OF SEQ ID NOS: 48 
SEQ ID NO 2 
LENGTH: 3499 
TYPE: DNA 

ORGANISM: Blue Gill 
FEATURE : 
NAME/KEY: CDS 
LOCATION: ( 424 )...{ 1944 ) 
US-10-501-227-2 



Query Match 95.9%; 
Best Local Similarity 96.6%; 
Matches 1746; Conservative 



Score 1674; DB 11; Length 3499; 
Pred. NO. 0; 
0; Mismatches 0; Indels 62; 



1 ATGTACCCTGAAGAGAGCAGGGGGTCCGGAGGGGTAGCCACTGTGGACTTTCTGGAAGGG 60 

! I L I ! 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M L M M 1 1 1 1 1 1 1 ) I L 1 M M I ! I M 1 L 1 1 1 1 1 1 1 L 

134 ATGTACCCTGAAGAGAGCAGGGGGTCCGGAGGGGTAGCCACTGTGGACTTTCTGGAAGGG 193 
61 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGTCTGGCTAC 120 

M M 1 1 1 E 1 M M I M 1 1 1 1 f 1 1 U U t M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I ! ! 

194 ACCTACGATTATGCCGCCCCCACCCCTGCCCCGACTCCTCTTTACAGCCAGTCTGGCTAC 253 
121 TACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCCCTGGGC 180 

M 1 1 1 1 1 M 1 1 1 1 1 M ! M 1 1 1 11 1 1 1 1 1 1 1 M 1 1 U tt i M U 1 1 1 1 II 1 1 1 1 M 1 1 1 1 

254 TACTCTGTACCTCTGGACGCCCAAGGGCCACCCTCAGATGGCAGCCTTCAGTCCCTGGGC 313 



I CACCCGCCCAGCCACCACTATCTGGAAACCACCTCAACACCCGTCTAC 288 

MIMIIMIIMMMIIHIIlMIUtllMMmiMMMII 

. CACCCG^CCAGCCACCACTATCTGGAAACCACCTCAACACCCGTCTACAGATGAGTCTGA 433 

, ___ : AGGTCTAGTG 298 

I II I I III II 

1 AAGACTGGTTATTAGGAAAAGAAAGGACGGTGGTGACCATGGAGGAGCTGAGGTCTAGTG 493 
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299 TCCCATCCAGCCAGCAGCCAGTTCCCAGAGAGGACCAGTGTGCCACCAGTGATGAGTCCT 358 

iiiiiiiiiiiiiiiiiiiiiii iiiiiii i Miiiiiiiiiiiiiiiinm „, 

494 TCCCATCCAGCCAGCAGCCAGTTCCCAGAGAGGACCAGTGTGCCACCAGTGATGAGTCCT 553 
359 ATAGTGTGGGGGAGTCAGGGGCTGGAGCCAGGGGGTTTGAGATGGCCAAGGAGATGCGTT 418 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIII1IIIIIIIMII 

554 ATAGTGTGGGGGAGTCAGGGGCTGGAGCCAGGGGGTTTGAGATGGCCAAGGAGATGCGTT 613 
419 TCTGTGCTGTGTGCAGTGACTATGCCTCTGGGTACCACTACGGGGTGTGGTCCTGTGAAG 478 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIUIIIIIIIII 

614 TCTGTGCTGTGTGCAGTGACTATGCCTCTGGGTACC ACTACGGGGTGTGGTCCTGTGAAG 673 
479 GCTGTAAGGCCTTCTTTAAGAGGAGCATCCAGGGTCACAATGACTATATGTGCCCAGCAA 538 

INMININIIIIIIIIIIIIilllinillllllllllllllll---jjjjjjjjj AGGTCTAGTG 298 
434 aagactggttattaggaaaagaaaggacggtggtga| 1 1 1 1 1 1 1 1 I I I I 

674 GCTGTAAGGCCTTCTTTAAGAGGAGCATCCAGGGTCACAATGACTATATGTGCCCAGCAA 733 
539 CCAATCAGTGTACTATTGACAGGAATCGGAGAAAGAGCTGCCAGGCTTGCCGTCTTAGGA 598 

IIIIIIIIIIIIIIIIIIHIIIIIIIIIIIlllllllllllllllllllllllllllll 

734 CCAATCAGTGTACTATTGACAGGAATCGGAGAAAGAGCTGCCAGGCTTGCCGTCTTAGGA 793 
599 AGTGTTATGAAGTGGGCATGATGAAAGGAGGTGTTCGCAAGGACCGTGGCCGTGTTTTGC 658 

IlllUllllllllllllllllimillllllllllllllllUIIIIIIIIIIIIIII 

794 AGTGTTATGAAGTGGGCATGATGAAAGGAGGTGTTCGCAAGGACCGTGGCCGTGTTTTGC 853 
659 GCCGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAGGACCTGGAGT 71B 

MIlllllllMIIIIIIMIIIIIllllllllMIMIIIIIIIIIMnillllllll 

854 GCCGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAGGACCTGGAGT 913 
719 ACAAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGTGCCGGTGGTG 778 

illlllllM III lllll [III MMIMIIIIIIIIIIIIIIIIIIIIIM M 111 I II 

914 ACAAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGTGCCGGTGGTG 973 
779 GAGGAGGAAAATCATCAGTGACCGGGATGTCTCCTGACCAGGTGCTCCTCCTGCTCCAGG 83B 

i inn iiiiiii inn iiiiiii mm mi minimum mi 

974 GAGGAGGAAAATCATCAGTGACCGGGATGTCTCCTGACCAGGTGCTCCTCCTGCTCCAGG 1033 
839 GTGCCGAGCCCCCAATGCTGTGCTCCCGTCAGAAGCTGAGCCGACCGTACACCGAGGTCA 898 

794 AGTGTTATGAAGTGGGCATGATGAAAGGAGGTGTTCGCAAGGACCGTGGCCGTGTTTTGC 853 
659 GCCGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAGGACCTGGAGT 718 

iiiiiimimmimmimiMimmiiiimiiiiiiiiiiiimi 

854 GCCGTGATAAACGACGTGCTGGAACCAATGACCGAGAGAAGGCCTCTAAGGACCTGGAGT 913 
719 ACAAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGTGCCGGTGGTG 778 

lllillllllllMMIIIIIlMMIIIIIIIIIIIIIIIIIIIIIIllllllllllll 

914 ACAAAACAGTGCCCCCTCAGGACAGGAGGAAACACAGCAGCAGCAGCAGTGCCGGTGGTGGTCAGAAGCTGAGCCG 
899 CCATAATGACACTACTCACCAGCATGGCCGATAAGGAGCTGGTCCACATGATCACCTGGG 958 

mmiiiiiiii iiiimmiimiMiiiiiiiiiiiiuimiii iib , 

1094 CCATAATGACACTACTCACCAGCATGGCCGATAAGGAGCTGGTCCACATGATCACCTGGG 1153 
959 CCAAGAAGCTTCCAGGTTTCCTGCAGCTGTCTCTCCATGACCAGGTGCAGCTGCTGGAGA 1018 

imiiimiiiiiiiiimiiimi illinium mini 

1154 CCAAGAAGCTTCCAGGTTTCCTGCAGCTGTCTCTCCATGACCAGGTGCAGCTGCTGGAGA 1213 
1019 GCTCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCACTGCCCCGGCA 1078 

uumiiumuumumuiiiimuuimimmiumuii 

1214 GCTCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCACTGCCCCGGCA 1273 
1079 AACTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGTGTGGAAGGCT 1138 

unuinmumuiiumuimummumuiiiiuuiuu 

1274 AACTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGTGTGGAAGGCT 1333 
1139 TTGTTGAGATCTTCGACATGCTGCTGGCCACTGCCTCCCGCTTCCGCATGCTCAAACTCA 1198 

nuuinimummuuinmimmmmuimmiimii 

1334 TTGTTGAGATCTTCGACATGCTGCTGGCCACTGCCTCCCGCTTCCGCATGCTCAAACTCA 1393 
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)y 1199 AACCTGAGGAGTTTGTCTGCCTCAAAGCTATCATCCTGCTCAACTCTGGTGCCTTCTCTT 1258 

imiiimiimiiiiiiiiiiiiiiiiiiiiMiMiiiimiiiiimiiiii 

>b 1394 AACCTGAGGAGTTTGTCTGCCTCAAAGCTATCATCCTGCTCAACTCTGGTGCCTTCTCTT 1453 

Jy 1154 CCAAGAAGCTTCCAGGTTTCCTGCAGCTGTCTCTCCATGACCAGGTGCAGCTGCTGGAGA 1213 

3y 1019 GCTCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCACTGCCCCGGCA 1078 

I I I 1 I I I I M I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
>b 1214 GCTCGTGGCTGGAGGTGCTGATGATTGGGCTCATATGGAGGTCCATCCACTGCCCCGGCA 1273 

3y • 1079 AACTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGTGTGGAAGGCT 1138 

MiiiiiiiiimmiiiiMiiiiiiiimmiiiiiimiiiiii in 

Db 1274 AACTCATCTTCGCACAGGACCTCATACTGGACAGGAATGAAGGTGACTGTGTGGAAGGCT 1333 

2y 1139 TTGTTGAGATCTTCGACATGCTGCTGG 1259 TCTGCACCGGCACAATGGAGCCCCTCCACAACAGCATGGC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1454 TCTGCACCGGCACAATGGAGCCCCTCCACAACAGCATGGCAGTGCAGAACATGCTGGACA 1513 

Qy 1319 CCATCACAGACGCTCTCATACATCATATCAGCCAATCAGGATGCTCGGCTCAGCAGCAGT 1378 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I M I I I I I I 
Db 1514 CCATCACAGACGCTCTCATACATCATATCAGCCAATCAGGATGCTCGGCTCAGCAGCAGT 1573 

Qy 1379 CGAGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATGAGCAACAAAG 1438 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1574 CGAGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATGAGCAACAAAG 1633 

Qy 1439 GCATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTACGACCTTCTGC 1498 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1634 GCATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTACGACCTTCTGC 1693 

Qy 1499 TGGAGATGTTGGACGCTCACCGTATACACCGCCCAGACAGACCAGCTCAGTTCTGGTCCC 1558 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1694 TGGAGATGTTGGACGCTCACCGTATACACCGCCCAGACAGACCAGCTCAGTTCTGGTCCC 1753 

Qy 1559 AGGCTGACGGAGAGCCTCCCTTCATTAACAACAACAACAGCAGCAACAGTGGCAGCAATG 1618 

II II 1 1 1 !l I II I II Mill IMIM 1 1 1 !;:'!: II llllll I IN III III I! 1 1 M 

Db 1754 AGGCTGACGGAGAGCCTCCCTTCATTAACAACAACAACAGCAGCAACAGTGGCAGCAATG 1813 

Qy 1619 GCGGCGTCTCCTCTTCAGTCGGTTCCAGTTCAGGACAGGATGCTCGGCTCAGCAGCAGT 1573 

Qy 1379 CGAGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATGAGCAACAAAG 1438 

llllllllllillllllllllMIIIIIIIMIllllMIIIIIMMIIIIIIIIIIII 

Db 1574 CGAGGCGGCAGGCCCAGCTGCTGCTCCTGCTCTCCCACATCAGGCACATGAGCAACAAAG 1633 

Qy 1439 GCATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTACGACCTTCTGC 1498 

III Illlllllllllilllllll Illlllllllllllllllllllllll 

Db 1634 GCATGGAGCATCTCTACAGCATGAAGTGCAAGAACAAAGTGCCTCTTTACGACCTTCTGC 1693 

Qy 1499 TGGAGATGTTGGACGCTCACCGTATACACCGCCCAGACAGACCAGCTCAGTTCTGGTCCC 155 B 

CCCCGAGTCAACCACGAGAGCCCGA 1678 

1814 GCGGCGTCTCCTciicA^ 1873 

Qy 1679 GCAGAGGACCCACAGGTCCAGGAGTCCTGCAGTACGGAGGGTCCCGCTCTGACTGCACCC 1738 

nun i iiiiiiii iii iiiiiiiiiiiiiiiiiiii miii mi i ii i iii mi ii 

Db 1874 GCAGAGGACCCACAGGTCCAGGAGTCCTGCAGTACGGAGGGTCCCGCTCTGACTGCACCC 1933 

Qy 1739 ACATCCTA 1746 

Illlllll 
Db 1934 ACATCCTA 1941 

RESULT 3 
US-10-501-227-3 

j Sequence 3. Application US/10501227 
Publication No. US2OO60141560A1 
GENERAL INFORMATION: 
APPLICANT: SAITO, Koichi 

TITLE OF INVENTION: Estrogen receptor genes and utilization thereof 
FILE REFERENCE: 600630-21US (561334) 
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Ligand-binding domain of estrogen receptors 

Anke Mueller-Fahrnow* and Ursula Egnert 



Estrogen receptors are multi-domain proteins that interact with 
other proteins and DNA to fulfil their function: the regulation of 
transcription. During the past 2-3 years, our understanding of 
this complex process has increased tremendously as crystal 
structures of isolated ligand-binding domains in complex with 
various ligands, as well as co-activator peptides, are now 
available. The structural information, combined with new data 
on novel co-activators/co-repressors, muteins and their 
actions, and novel ligands, allows for the first time the 
development of detailed theories for the first steps of 
transcription initiation. 
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Abbreviations 

DES diethylstilbestrol 

E2 estradiol 

ER estrogen receptor 

LBD ligand-binding domain 

LBP ligand-binding pocket 

NR nuclear receptor 

RXR retinoicX receptor 

Introduction 

The estrogen receptor (ER) belongs to the superfamily of 
nuclear receptors (NRs). It regulates the expression of spe- 
cific genes and thus plays a crucial role in many processes, 
such as the control of reproduction and the development of 
secondary sexual characteristics [1,2]. It has therefore been 
a target for research within pharmaceutical companies for 
many years. As a result, thousands of steroidal and non- 
steroidal ligands to the ER have been synthesized, which 
act either as agonists, partial agonists or pure antagonists. 

NRs are built up by five to six domains serving different 
functions. Out of these, the ligand-binding domain (LBD; 
E-domain) is of special interest not only due to its involve- 
ment in ligand binding but also in receptor dimerizacion 
and in the transactivation process: the carboxy-terminal 
region of the LBD comprises the transactivation function 
AF-2 [3*]. Until recently, the structural basis for these 
functions has only been poorly understood, but during the 
past few years crystal structures for the human (h)ERa 
LBD [4,5,6"] and hERp LBD [7], as well as for a number 
of other NR LBDs, have been determined [8-15]. With 
these structures at hand, the differences and similarities 
between hERa LBDs (Egner U eta/., unpublished data) 
and the recently identified hERp" [7,16] can be analyzed. 



Although the availability of the structures has contributed 
to our understanding of ER actions, a number of open 
questions remain, especially concerning the conformation- 
al changes that occur upon ligand binding (described 
below). In this review, we give an overview of new data on 
estrogen receptor LBDs with a special emphasis on struc- 
tural aspects. We try to outline the implications of the 
structure on our understanding of the protein function and 
point out areas for future research. 

Homology with other nuclear receptors 

With respect to sequence homology, the superfamiliy of 
NRs can be divided into several subfamilies. The ER sub- 
family comprises ERa and ERp, which share -55% 
sequence identity in their LBDs (two proteins are expect- 
ed to share a similar fold if the sequence identiry is >30%, 
and if the identity is as high as 50% the three-dimensional 
structures can be assumed to be very similar), and the 
estrogen-teceptor-related orphan receptors ERR1 and 
ERR2 [17]. On the mRNA level, different splice variants 
have been identified for ERs, but the corresponding pro- 
teins have not been detected [18-23]. 

In the June 1999 version of the SWISS-PROT/TrEMBL 
database [24], around 30 ER LBD sequences from different 
species were found, omitting those with only partially deter- 
mined sequences. Based on a multiple sequence alignment, 
a phylogenetic tree was reconstructed (Figure 1). From this 
presentation, it is obvious that the interspecies homology for 
the LBDs is much higher than the similarity between the ot 
and P receptor subtypes. It should also be mentioned that 
for both ER homologues, LBDs from fish and vertebrates 
can be clearly distinguished with respect to their sequence 
homologies (Figure 1). 

The distribution of conserved residues on the surface of 
the hERa LBD is shown in Figure 2a. Highly conserved 
residues within the ER subfamily are indicated by yel- 
low colouring of che protein, which is shown in the same 
orientation as in Figure 3. A striking feature of the ER 
subfamily is that most of the conserved residues within 
the LBD are located in one area of the protein, close to 
one assumed co-activator binding site (see below). A 
similar analysis for the progesterone receptor subfamily 
is given in Figure 2b. The human proteins belonging to 
this subgroup (progesterone, glucocorticoid, androgen 
and mineralocorticoid receptors) share -50% sequence 
identity within their LBDs, but conserved residues are 
more equally distributed (Figure 2b). The relevance of 
this observation is not clear. We know, however, that 
LBDs function via conformational changes that occut 
upon ligand binding and by interactions with other ER 
domains and with co-activators/co-rcprcssors. It can 
therefore be speculated that a highly conserved region 
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Phylogenetic reconstruction of the ER LBD 
sequences (amino acid range as shown in 
Figure 4) according to the neighbor-joining 
method as implemented in CLUSTALW V1.7 
[49J. Accession numbers referring to the 
Swiss-PROT/TrEMBL. database [24] are 
given. The ERR1 and ERR2 sequences are 
defined as outgroup. 



ft 

4 



P81559 ERa Xenppus laevis 
Q9YHT3 ER Anolis carolinensis 
AAD1731 6 ERa Equua caballus 
Q29040 ERa Sus scrofa 
PI 9785 ERa Mus musculus 
P0621 1 ERa Raitus norvegicus 
P03372 ERa Homo sapiens 
Q1 351 1 ERa Homo sap/ens 80 kOa 
P06212ERaGa//usffa/fes 
Q91 250 ERa Poephila guttata 
Q9YHZ7 Era Ictaluws punctatus 
P50241 ER Oiyzlas latipes 
P50240 ER Oreochromis aureus 
Q9YH33 ERa Oreochromis nibticus 
Q421 32 ER Pagrus major 
AAD31 032 ERa Spams aurata 
P50242 ER Salmor safer 
PI 6058 ER Salmo irideus 
Q9YGE6 ER Oncorhynchus mykiss 
AAD24432 ERp Bos taurus 
Q92731 ER(3 Homo sapiens 

008537 ERp Mus musculus 
Q62986 ER(3 Rattus norvegicus 
01 301 2 ER Anguilla Japonic* 
AAD26921 ERp C 



- AAD31033ERpSparus al 
r P11474ERR1 Homosaph 
"~L P I 1 473 ERR2 Hwnu suyl 



on the protein surface is indicative of a conserved site for 
protein-protein interactions. 

The three-dimensional structure 

For most steroid receptors, as well as for a number of relat- 
ed nuclear receptors, crystal structures of LBDs have been 
determined during the past few years; however, not all of 
them have been published. In addition to the experimen- 
tally determined structures, a number of models have been 
published based on related structures and on a number of 
additional supporting biological and binding data [25,26*]. 
An overview of the ER LBD structures and on some rele- 
vant models is given in Tabic 1. The overall structures of 
the hERcc and hERp LBD-raloxifene complexes are very 
similar, the root-mean-square deviation between equiva- 
lent Cot atoms (excluding helix H12) is 0.97 A [7]. 

All known NR LBD structures consist of 11-12 a-helices 
and one small P-sheet. The secondary structure elements 



are arranged in a so-called oc-helical sandwich, a protein fold 
that has up to now only been observed for NR LBDs. This 
fold is shown as a carcoon in Figure 3 together with the num- 
bering of some of the helices. Helix H2, which has been 
observed, for example, in the retinoic X receptor (RXR) 
LBD structure [8] is absent in the hERa LBD. The mainly 
hydrophobic binding niche, which is described below in 
more detail, is located in the lower part of che protein with 
respect to the orientation of the protein shown in Figure 3. 

A detailed comparison with the structures of other NR 
LBDs shows that the upper part of the protein (with 
respect to the orientation shown in Figure 3) is structural- 
ly much better conserved than the lower part harbouring 
the ligand-binding pocket (LBP) (for details see [3*]; 
Egner Uetal., unpublished data). This can be rationalized 
as the natural ligands differ considerably in size and 
shape; the structural variability can thus be attributed to 
the affinity towards diverse ligands. 
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Connolly surface of the hERa LBD in complex with (a) estradiol 
compared to the (b) progesterone receptor LBD. Trie orientation of the 
proteins is the same as in Figure 3. Coloring is based on residue 
conservation: yellow, highly conserved residues l> 80% identity); blue, 
homologous residues (> 80% similarity); green, no sequence 
conservation. The red arrow indicates the position of helix H1 2, the tip 
of the arrow being the carboxy-terminus of the helix. 



The ligand-binding pocket 

ERs bind the endogenous ligand estradiol with a very high 
affinity of 0.05 nM for hERa and 0.09 nM for hERfj, 
respectively [27]. The procein-ligand interactions have 
been analyzed and described in detail [28-31,32']. The 
otherwise hydrophobic LBP, which is not accessible from 
the protein surface in die agonist-bound stace (see below), 
harbours two polar regions located at opposite ends of the 
cavity. At the deep end of the pocket, Glu353 and Arg394 
(hERa numbering) serve as anchoring points for the 
3-hydroxy group of estradiol (E2). In addition, a conserved 
water molecule stabilizes this interaction. This hydrogen 
bond network explains the predominant contribution of 
the 3-hydroxy group to the binding affinity of the steroid 
[25]. Similar hydrogen bonds are observed in all available 
hERa and hERp* LBD structures. 

The 17-hydroxy group of E2 interacts with the second 
polar region within the LBP, that is, with His524 (hERa 
numbering). Different sidechain conformations for this 
residue are observed in the various complex structures 
known to date. Except for these two polar regions, the 
LBP is lined by hydrophobic residues. The fact that some 
are flexible (e.g. methionine and isoleucine) accounts for 
the high affinity of a variety of steroidal and non-steroidal 
ligands, including phytoestrogens such as genistein [7,27]. 

As the estrogen receptors are of such high medical relevance, 
thousands of ligands have been synthesized within research 
institutions and in the pharmaceutical industry. With respect 
to their functions, the ligands can be classified as agonists 
(e.g. E2 [32*]) partial agonists (e.g. raloxifene [28]) and as 
pure antagonists [30,32*]. Up to now, structures are only 
available for the first two groups, so that the structural basis 





■ - 








1 

;l|f;. ' flf'uv.,,, . . 


i;: S If 


' ■ ■ ' ■■ 




!!!! 







Cartoon [501 of the hERa LBD in complex with (a) the endogenous 
ligand estradiol and (b) the partial agonist raloxifene. The helices are 
represented as barrels. For some of them, the numbering is given. 



for the differences between partial agonists and pure antago- 
nists is thus not yet understood. 

Conformational changes 

For the hERa LBD, seven crystal structures are available. 
The most scriking difference between these structures is 
the orientation of helix H12. A comparison with related 
struccures ([4,6**]; Ruff M eta/., personal communication) 
indicates that the position of this helix depends on the 
presence of a ligand. In the only available apo form of a NR 
LBD, the RXR LBD, this helix points away from the pro- 
tein core [8]. In most of the available agonist-bound LBDs, 
however, the helix is folded back and serves as a lid clos- 
ing the entrance to the LBP. At the same time, AF-2 is 
brought into a position where co-activators can bind result- 
ing in transcription initiation. In the case of the hERa 
LBD, helix H12 is anchored in this 'agonist conformation' 
by hydrophobic interactions as well as by hydrogen bonds. 

The more space-consuming ER partial agonists with their 
long-chain substituents cannot be accommodated in the 
LBP in the 'agonist conformation' of the protein. It was 
therefore expected that upon partial agonist binding, helix 
H12 would be present in another orientation. The two dif- 
ferent orientations of helix H12 induced by the agonist E2 
and the partial agonist raloxifene are depicted in Figure 3, 
showing that this is the case. Recently, the crystal structure 
has been determined for the ternary complex of the hERa 
LBD with the agonist diethylstilbestrol (DES) and a pep- 
tide of the co-activator GRIP-1 [6**]. The position of the 
a-helical peptide almost coincides with helix HI 2 in the 
antagonise conformation of the protein: upon superposi- 
tion, the distance between equivalent Ccc positions varies 
between 0.6-2.2 A. This observation supports the experi- 
mental finding that antagonists block co-activator binding. 

In contrast co expectation, two structures of the hERa 
LBD in complex with E2 are presently known where helix 
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Figure 4 

Sequence alignment of hERa, mouse ERa 
and hERfJ LBDs. Positions of mutated 
residues are marked by an (x) below (hERa) 
or above (mouse ERa) the sequences. 
Residues belonging to helices in the hERa 
LBD-estradiol complex are indicated by a 
gray background; residues in contact with 
ligands are printed bold face; the numbers of 
the helices are given. 




I LSHISHMSNK GMEHLYNMKC KNWPLYDLL LEMLDAH SIX 5 
399 SMRLANLU4L LSHVRHASNK GMEHLLNMKC KHWPVYBLL LEMLNAH VLR 44B 



H12 is not in che agonise conformacion. In the artificial 
tetramcr crystallized by Tannenbaum et al. [5], this helix 
interacts with a neighboring LBD molecule, and in the 
recently crystallized triple mutant (RuffM et al., personal 
communication), the helix is in the 'partial agonist confor- 
mation'. The meaning of this structural variablity is not yet 
fully understood but might be caused by the absence of 
the carboxy-terminal F-domain (adjacent to helix HI 2) of 
the hER in all crystal structures. There is no experimental 
evidence that che structural variability observed in the 
structures is caused by the absence of the F-domain. It can 
not, however, be excluded that an additional domain has a 
stability effect on three dimensional structure. 

Differences between receptor subtypes 

The tissue distribution of hERa and hERfJ was investigat- 
ed shortly afcer the discovery of the second isoform [16]. 
hERB is the predominant subtype in bone, blood vessels 
and in the brain, whereas its content in uterus and liver is 
only low. Because of the different distribution in che vari- 
ous carget organs, che medical interest in subtype-specific 
ligands is high. 

Both human receptors share -55% identical amino acid 
residues in their LBDs. The analysis of the LBPs shows that 
most residues in direct contact with E2 in the hERa LBD 
are conserved in hERB. The same is true for residues in the 
second shell around the ligand. There are a number of dif- 
ferences, however, in the vicinity of the steroidal A-ring 



close to position 4 of estradiol. In this area, six homologous 
replacements are encountered (e.g. Ue-*Leu, Met or Val). 
Another two homologous exchanges are present in the envi- 
ronment of the steroidal D-ring. In spite of the high degree 
of conservation between the LBPs, an ERa subcype-specif- 
ic ligand exhibiting a more than 100-fold difference in 
binding affinity has recently been published [31). 

Dimerization and binding of co-activators/ 
co-repressors 

Receptor dimerization is a prerequisite for transcription initi- 
ation in ER-regulated processes. As LBDs contribute to the 
dimerization interface, it was expected that the hERa-E2 
complex would crystallize as a dimer, as is the case. There is 
evidence that the interactions observed in the crystals reflect 
the natural dimerization concaccs; firstly, all published point 
mutants influencing ER dimerization are in, or at least close 
to, a region involved in crystal contact; and secondly, dimers 
have been observed in crystal structures of related LBDs 
([4,6"]; Ruff M eta/., personal communication). The topolo- 
gy of the dimer interfaces is comparable in all these 
structures, especially concerning the key role of helix H10 in 
the incermolecular contacts, In the case of the hERa-E2 
complex, the monomer-monomer interactions are very 
strong: 27 amino acid residues contribute to the large contact 
surface and out of these 14 are involved in hydrogen bonds. 

In recent years, several novel classes of transcriptional 
mediators (co-activators and co-repressors) that interact 
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<sof hERaand hERpLBDs, 



Ligand/co-activatoe peptide 



Partial agonist/ Co-adivator 



LBD carboxy-methylated [6~] DES 
Wild-type LBD W Estradiol 
LBD carboxy-methylated [4] Estradiol 



GRIP-1: 686-696 



LBD carboxy-methylated [4 



Estradiol and others 



hERpLBD [7] 

LBD carboxy-methylated Genistein 



(a) M Ruff, M Qangloff, M Eiler, S Duclaud, JM Wurtz, D Moras, unpublished data. DES, diethylstilbestrol. 



with LBDs have been identified and characterized 
[33-35,36* ,37-39], It should be mentioned here that some 
of these mediators, such as NSD1 and GRIP-1, are ligand 
dependent [33,36*], whereas for others, such as SRC-1, a 
hormone-independent mechanism is discussed [39]. The 
recently elucidated structure of hERcc LBD in complex 
with a GRIP-1 peptide has already been mentioned above 
[6**]; however, a comparable complex structure with a hor- 
mone-independent mediator is not yet available. 

Mutations 

In contrast to, for example, the androgen receptor, only 
one germline mutation is known for the hER, which led to 
osteoporosis in a young man [40]. A number of splice vari- 
ants have been identified on the mRNA level and their 
biological role remains to be elucidated. Furthermore, 
some point mutations have been identified in patients 
with primary breast cancer [41]. 

For a better understanding of the functional role of different 
residues, a number of engineered mutants of the hERa 
LBD have been investigated ([26*] and references therein, 
[41-48]), but only four mutants for hERP LBD have yet 



been published. Positions of mutated residues are indicated 
in the sequence alignment shown in Figure 4. For a number 
of these mutants, the influence on ligand binding has been 
analyzed in detail (e.g. [43], and references in [26*]). For the 
vast majority of them the influence on the binding activity 
could be understood on the basis of the hERa LBD struc- 
ture. In the more recent literature, the effect of muteins on 
the transcription potency and on the interaction with co- 
activators/co-repressors is also described (e.g. [44,46]). 

Conclusions 

During the past few years, the progress in our understand- 
ing of the hERa LBD actions has been tremendous. 
Several scientific areas have contributed to this achieve- 
ment. Firstly, after more than one decade of intensive 
work, crystal structures of hERa and hERP LBD com- 
plexes, as well as of related NR LBDs, are now available. 
Secondly, novel classes of co-activators and co-repressors 
have been identified and characterized, which have been 
mentioned only briefly in this review. Thirdly, new 
steroidal and non-steroidal ligands to the hERs have been 
synthesized. A special emphasis has been on the search for 
pure antagonists, which are considered to have an 
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improved clinical profile as compared to the commonly 
used partial agonises, such as tamoxifen. Finally, the effect 
of engineered mutants has been studied in detail and sup- 
ported our understanding of the role of single residues 
within the protein subunit. 

In spite of the progress outlined above, many open ques- 
tions and uncertaincies remain concerning the ER LBD 
actions. To give one example, no crystal structure is avail- 
able for a complex with a pure antagonist, that is, the 
structural basis for the functional differences between par- 
cial agonists and pure antagonists is unknown. 
Furthermore, the role of helix H12 and its different orien- 
tations has in part been deducted from crystal structures 
in which the domain adjacent to helix H12, the F-domain, 
is missing. The conformational variability of this helix 
might thus be an artefact depending upon the choice of 
the protein fragment used for the crystallization trials. 

If the number of new ER LBD structures further increas- 
es at its current rate and if the information about the 
co-activators and co-repressors continues to contribute 
equally fast to our understanding of ER actions, we will 
soon be able to form a realistic picture on the early stages 
of transcriptional activation. 
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